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(Dated: March 27, 2009) 

Abstract 

We present a search for the standard model Higgs boson using hadronically decaying tau leptons, 
in 1 fb _1 of data collected with the DO detector at the Fermilab Tevatron pp collider. We select two 
final states: plus missing transverse energy and b jets, and r + r~ plus jets. These final states 
are sensitive to a combination of associated W/Z boson plus Higgs boson, vector boson fusion and 
gluon-gluon fusion production processes. The observed ratio of the combined limit on the Higgs 
production cross section at the 95% C.L. to the standard model expectation is 29 for a Higgs boson 
mass of 115 GeV. 

PACS numbers: 13.85Qk, 13.85.Rm, 14.80Bn 
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A standard model (SM) Higgs boson with a mass in the range 105 — 145 GeV is expected 
to be produced in pp collisions at a center-of-mass energy of 2 TeV with cross sections of 
(9(100 fb) for associated VH production (V = W or Z) and vector boson fusion (VBF), 
qq — > VVq'q" —>■ q'q"H, and of 0(1 pb) for gluon-gluon fusion (GGF) [1]. Previous searches 
for the SM Higgs boson at the Fermilab Tevatron collider [2j have sought the VH processes 
with W/Z decays to leptons other than taus and H — > bb, and the gluon fusion process with 
H — > VV* with V(V*) — > ee or fifi. Thus far, there have been no published searches in the 
case that either the V or H decays to r leptons. Given the small Higgs boson production 
cross sections, it is advantageous to use all possible decay modes to increase the search 
sensitivity. Here, we present a search designed for either of the two final states: r^v^r bb jets 
(denoted "rz/") or r + r~ + jets (denoted "rr"). The analysis is based on 0.94 fb -1 (rz/) and 
1.02 fb -1 (rr) of data collected by the DO experiment [3J at the Fermilab Tevatron collider. 

The tv analysis targets WH production with W — > rz/ and ZH production where Z — > rr 
but one r is not identified, both with H — > bb. The triggers used for selecting events require 
jets of high transverse energy, Et, and large missing transverse energy, fa- The offline 
selection of events requires at least one tau candidate decaying to hadrons, at least two jets 
identified as candidate b quark jets (b tagged) with transverse momentum > 15 GeV, 
and IpT, corrected for the presence of muons and taus, greater than 30 GeV. We reject 
events containing an electron with p? > 15 GeV or a muon with p? > 8 GeV to maintain 
independence from the rr analysis and other SM Higgs boson searches Q]. 

The rr analysis targets VH production with Z — > r + r _ and H — » bb (denoted "HZ"), 
V —> qq and H — > r + r~ ("WH" and "ZH"), VBF with H -> r+r~, and GGF with H -> 
r + r _ and at least two associated jets. We identify one of the taus through its decay to nv T V^ 
and the other in a hadronic decay mode. The events satisfy a combination of single muon 
and muon plus jets trigger conditions. Offline, events are selected j^] by requiring exactly 
one muon with p? > 12 GeV, pseudorapidity \r]\ < 2.0, and isolated from other tracks and 
calorimeter activity in a cone surrounding the muon track candidate. We also require a 
hadronic tau candidate and at least two jets. The r and /i are required to be of opposite 
charge for the primary event sample. Events containing an electron with p? > 12 GeV are 
rejected. 

We identify three types of hadronic taus, motivated by the decays (1) r 1 * 1 — > 7r ± z/, (2) 
r 1 * 1 — > ^tt v, and (3) r ± — > 7r ± 7r ± 7r T (7r°)z/. The identifications [5] are based on the number 
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of associated tracks and activity in the electromagnetic (EM) portion of the calorimeter, 
both within a cone TZ = (Ar/) 2 + (A0) 2 < 0.5, where <\> is the azimuthal angle. The 
requirements for the tv (tt) analysis are: for type 1, a single track with p^, k > 12 (15) 
GeV and no nearby EM energy cluster; for type 2, a single track with > 10 (15) GeV 
with an associated EM cluster, and for type 3, at least one track with > 7 GeV and 
Sp^ k > 20 GeV and an associated EM cluster. In addition to hadronic tau decays, type 2 
taus also contain r —>■ e decays. Due to the larger multijet background, type 3 taus are not 
used in the tv analysis. For the tt channel only those two-track type 3 candidates with both 
tracks of the same charge sign are retained to give unambiguous tau charge determination. 
A neural network (NN) [51] is formed for each tau type using input variables such as isolation 
and the transverse and longitudinal shower profiles of the calorimeter energy depositions 
associated with the tau candidate. Tau preselection is based on the requirement that the 
output NN value, NN T , exceeds 0.3 thus favoring the tau hypothesis. The tau transverse 
momentum p T T is constructed from the transverse energy observed in the calorimeter, E^, 
with type-dependent corrections based on the tracking information. For the three types we 
require p r T to be greater than 12 (15), 10 (15), or (20) GeV for the tv (tt) analyses. The 
tv analysis subdivides the type 2 taus according to whether the energy deposit is electron- 
like or hadron-like and the two subsamples are treated separately in assessing the multijet 
background. For type 2 candidates in the tt analysis, we require 0.7 < p^ k /E^, < 2 to 
remove backgrounds in regions with poor EM calorimetry or due to cosmic rays. 

Jets are reconstructed with a cone of radius 0.5 in rapidity-azimuth space {f|. Their 
energies are corrected to the particle level to account for detector effects and missing energy 
due to semileptonic decays of jet fragmentation products. We preselect jets with p T > 15 
GeV, \r]\ < 2.5, and separated by TZ > 0.5 from r and fi candidates. 

Backgrounds other than those from multijet (MJ) production are simulated using Monte 



Carlo (MC). We use alpgen |7j for it and V^+iets production; pythia [8| for WW, WZ 
and ZZ (diboson) production; and COMPHEP |9[ for single top quark production. The 
alpgen events are passed through pythia for parton showering and hadronization. The 
Higgs boson signal processes are generated using pythia and the CTEQ6L1 [lOj leading 
order parton distribution functions (PDF) for = 105 - 145 GeV in 10 GeV steps. We 



normalize the cross sections to the highest available order calculations for the signal 



and background [121 ] . Higgs decays are simulated using hdecay [13J and for tau decays 
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using tauola [ijj]. All MC events are passed through the standard DO detector simulation, 
digitization, and reconstruction programs. 

Backgrounds due to MJ production, with spurious^ or misidentified taus are estimated 
from data samples. For the tv analysis, an enriched multijet sample is formed by selecting 
taus with 0.3 < NN T < 0.7. The contributions from those background processes generated 
by MC are then subtracted to give the BG TU multijet background sample which has negligible 
Higgs boson signal and provides the shapes of the multijet distributions in the kinematic 
variables. The normalization is given by the ratio of the number of events in the signal 
region, NN T > 0.9, after subtracting MC backgrounds, to the number of events in the BG TJ , 
sample. 

For the MJ background in the rr analysis, we prepare a multijet background data sample 
(BG TT ), orthogonal to the signal sample (SG rr ) defined by the fi, r, and jet preselection 
cuts above, by reversing both track and calorimeter isolation requirements for the muon 
and by requiring NN T < 0.8. For both BG rr and SG TT samples, the MC backgrounds are 
subtracted, and the same sign (SS) or opposite sign (OS) \i - r charge combinations subsets 
are formed. The BG rT sample provides the shape of the multijet background, with the 
normalization obtained by multiplying the number of SS SG Tr events by the ratio of OS to 
SS events in the BG Tr sample. These ratios are determined separately for each tau type, 
and are observed to be close to one and independent of p!^ and p T T . 

The event sample for the tv analysis is obtained with additional requirements after the 
object selections described above: (a) at least two jets with pt > 20 GeV and < 3 jets with 
Pt > 15 GeV; (b) the angle A<p(^T,pr) < 7t ~/2, where /V is the negative of the transverse 
component of the net momentum of all tracks in the event 15) ; (c) < 200 GeV, where 
Ht is the scalar sum of the pt of all jets; (d) for hadron-like type 2 taus, the transverse 
mass, formed from the r and fa, less than 80 GeV; (e) dijet invariant mass in the range 
50 < Mjj < 200 GeV; and (f) the requirement A0(r,$ T ) < 0.02(vr - 2)(fa - 30) + 2 {fain 
GeV) to reduce contamination due to poorly reconstructed multijet events in which a jet 
misidentified as a tau is nearly collinear with fa. To further improve the signal (S) over 
background (B) separation, we require two jets to be tagged with a NN that discriminates b 
quark jets and jets from light partons [16|. Figure l(a,b) shows the Mjj distribution before 
and after b tagging and the event yields are summarized in Table [I] 

Most of the signal processes sought in the rr analysis contain light quark jets, so we do 
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Dijet mass (GeV) Dijet mass (GeV) NN output 



FIG. 1: The dijet mass distribution for all tau types for the rv analysis (a) before 6-tagging, and 
(b) after the final selection; (c) the combined NNzj ets variable for the low Higgs mass rr analysis. 
The signal is shown (multiplied by 30) for Mjj = 115 GeV. (color online) 



not employ b tagging. We require 2 jets with > 20 GeV. To further separate signals from 
backgrounds, we train a dedicated NN for the signal processes (HZ, WH, ZH, VBF) and for 
each of the main background types (W + jets, Z + jets, it and MJ). After requiring two jets, 
the MC GGF samples are small, making NN training unreliable. Since the GGF and VBF 
processes both involve non-resonant dijet systems, we incorporate the GGF events with the 
VBF sample when constructing the final limit analysis. The NNs are separately trained for 
low mass (105, 115 and 125 GeV) and high mass (135, 145 GeV) Higgs bosons, giving 32 NNs 
in all. Twenty well-modeled input variables are considered for each of the NNs. They include 
transverse or invariant masses of combinations of jets and leptons, angular correlations, 
and overall event distributions such as H? and aplanarity 17j . For each signal-background 
pair, a choice of six or seven variables is made using the criterion that each added variable 
must give significant improvement in S/ \I~B. The same variable choices are made for all 
Higgs boson masses. All NN input and output variables show good agreement between data 
and background prediction, and typically provide good discrimination between the signal 
and background under consideration. The it, W+jets and MJ NNs give good separation 
of signal and background, whereas the Z+jets NN signal and background distributions are 
not so well differentiated. Thus we define the variables NN^ g as the largest NN output 
variable among the various signals, for each background source, bg = it, W + jets, and MJ. 
We require NN^ g > 0.4, based on an optimization of the expected Higgs boson cross section 
limits. After this selection, the NN outputs trained against the Z + jets background for all 
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TABLE I: Numbers of events at the preselection level and after the final selection (b tagging for 
tv and NNbg cut for tt) for all r types combined, for data, estimated backgrounds and signal at 
Mh = 115 GeV. The y+jets background is given for light parton ("u,d,s,g" = "lp") and heavy 
flavor ("b,c" = "hf") jets separately. The uncertainties shown are statistical only. For the tv (tt) 
analysis the combined statistical and systematic uncertainties on the sum of backgrounds in the 
final selections are 5.3 (14.8) events. 





tv analysis 


tt analysis 


Source 


Preselection 


Final 


Preselection 


Final 


W+ lp 


110/1-1-18 
11Z4 it lo 


u.o ± u.u 


V7 7-1-9 1 
O I . 1 ± Z.l 


0.1 ± U.O 


W+ hf 


Qno 4- A 8 
OUo.Z ± 4.0 


i n o 4- n q 
iu.y ± u.o 


o.Z ± u.o 


n q 4- n i 
u.y ± u.i 


Z+ lp 


AO 1 4- 1 K 
4y.l ±1.0 


<. u.z 


78 A 4- D Q 

/0.4 ± u.y 


4o.o ± u.o 


Z+ hf 


1 .0 ± U.o 


(\ A inn 
U.4 ± U.U 


1 r 7 _L 1 n 

10. 1 ± l.U 


in 1 i n *7 
1U.1 ± U. / 


it 


46.7 ± 0.4 


9.5 ± 0.1 


30.8 ± 0.3 


2.8 ± 0.0 


Diboson 


54.9 ± 1.1 


0.7 ±0.0 


6.1 ±0.5 


2.1 ±0.2 


Multijet 


122.6 ± 11.2 


1.3 ±0.1 


57.2 ±8.1 


6.5 ±2.8 


Sum 


1714 ± 22 


23.3 ±0.4 


234 ±9 


71.2 ±3.0 


Data 


1666 


13 


220 


58 


HZ 






0.038 


0.029 


WH 


0.543 


0.201 


0.145 


0.106 


ZH 


0.023 


0.015 


0.094 


0.069 


VBF 






0.071 


0.059 


GGF 






0.041 


0.030 


Sum 


0.566 


0.216 


0.389 


0.293 



signals are combined by taking their weighted average, NNzjets, over the four signal processes 
(HZ, WH, ZH, VBF), with weights equal to the relative expected yield for each signal. The 
NNzjets distribution for the final sample is shown in Fig. 1(c), now including the GGF signal 
events. The signal and background event yields are given in Table [H 

Some systematic uncertainties induce a shape dependence on the final limit setting vari- 
able. For the tv analysis, such shape dependence is found for the jet energy scale, jet energy 
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resolution, and the 6-tagging efficiencies. Alternate shapes are determined by changing the 
relevant parameter by ±1 standard deviation from the nominal value and are provided to the 
limit setting program. For the rr analysis, only the multijet background is found to give an 
appreciable shape change. It is determined by varying the method for selecting MJ events, 
reversing either the muon or the tau requirements, but not both, relative to the standard 
choice. The remaining 'flat' systematic uncertainties do not affect the final variable distri- 
bution shape. Such flat uncertainties for the tv (rr) analysis are, unless otherwise noted, 
fully correlated for different backgrounds and analysis channels, and include (a) integrated 
luminosity, 6.1% (6.1%) isj]; (b) trigger efficiency, 5.5% (3%) (uncorrected tv and rr); (c) 
muon identification, (4.5%); (d) tau identification, 5.0-6.0% (5.0%); (e) tau track efficiency, 
3.0% (3.0%); (f) tau energy scale, 2.3-2.7% (3.5%); (g) jet identification and reconstruction, 
1.7-4.9% (2%); (h) jet energy resolution, (4.5%); (i) jet energy scale (7.5%) Q; (j) MC 
background cross sections, 6-18% (6-18%) (these are taken to be uncorrelated among the 
backgrounds); (k) higher order correction for the V^+jets cross section, 20% (20%); (1) V+ 
heavy flavor jet cross section correction, 30% (30%); and (m) multijet background, 82-100% 
(uncorrelated tv and rr). 

The upper limits on the Higgs boson cross section are obtained using the modified fre- 
quentist method [20|. For the tv analysis, the test statistic is the negative log likelihood 
ratio (LLR) derived from the My distribution. For the rr analysis, the LLR is formed from 
the AWzjets final neural network variable. The confidence levels CL s+ b (CL&) give the proba- 
bility that the LLR value from a set of simulated pseudo-experiments under the signal plus 
background (background-only) hypothesis is less likely than that observed, at the quoted 
CL. The hypothesized signal cross sections are scaled up from their SM values until the 
value of CL S = CL s+ b/ CLb reaches 0.05 to obtain the limit cross sections at the 95% C.L., 
both for expected and observed limits. In the calculation, all contributions to the system- 
atic uncertainty are varied, subject to the coustraiuts giveu by their estimated vaiues, to 
give the best fit [21[. The correlations of each systematic uncertainty among signal and/or 
background processes are accounted for in the minimization. 

The ratios of the expected and observed upper limits to the SM expectations are shown 
in Table HT1 for the two channels separately and combined. For all Higgs masses, the observed 
limits are within la of the expected limits. At Mh = 115 GeV, the observed (expected) 95% 
C.L. limit is 29 (28) times that predicted in the SM for the seven signal processes considered 
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TABLE II: Expected and observed 95% C.L. upper limits on the Higgs boson production cross 
section relative to the SM predicted value, for the tv and rr analyses separately and combined. 





TV 


analysis 


TT 


analysis 


Combined 


M H (GeV) 


exp 


obs. 


exp 


obs. 


exp. obs. 


105 


33 


27 


39 


36 


24 20 


115 


42 


35 


43 


47 


28 29 


125 


62 


60 


60 


65 


40 44 


135 


105 


106 


87 


61 


63 50 


145 


226 


211 


158 


95 


120 82 



in the combined tv and rr analyses. This is the first limit on SM Higgs production using 
final states involving hadronically decaying tau leptons. These results contribute to the 
sensitivity of the combined Tevatron search for low mass Higgs bosons 
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